Added PatchTSMixer model #404

Jasmine-Yuting-Zhang · 2025-12-01T04:53:15Z

This PR introduces support for the PatchTSMixer model in the Plato federated learning framework for time series forecasting tasks.

Description

Specifically, this PR:

Added ETT.py to support the Electricity Transformer dataset, including data loading, preprocessing, and federated partitioning logic.
Integrated the PatchTSMixer model architecture in HuggingFace for time series forecasting within Plato.
Added TOML configuration files for PatchTSMixer experiments under configs/TimeSeries/.
Added mean squared error (MSE)–based evaluation for PatchTSMixer experiments.

How has this been tested?

Quick check evaluation:

uv run python plato.py --config configs/TimeSeries/patchtsmixer_custom.toml

This configuration runs only 3 rounds, which is useful for quick functional tests and CORE-style checks. The run completed successfully without runtime errors.

Longer training run:

uv run python plato.py --config configs/TimeSeries/patchtsmixer_large.toml

This configuration uses more rounds. After 400 rounds, the MSE dropped from 7.14 to around 1.30, indicating that the model and data pipeline are working as expected.

Types of changes

Bug fix (non-breaking change which fixes an issue) Fixes #
New feature (non-breaking change which adds functionality)
Breaking change (fix or feature that would cause existing functionality to not work as expected)

Checklist:

My code has been formatted using the Ruff formatter (ruff format) and checked using the Ruff linter (ruff check --fix).
My change requires a change to the documentation.
I have updated the documentation accordingly.

…core_eval.py).

…dule under 'external/nanochat'.

- Resolved a RuntimeError caused by non-contiguous tensors during view operations (in nanochat - gpt.py): "view size is not compatible with input tensor's size and stride...". Replaced .view() with .reshape()

- Resolved an issue where the configuration requested 'train_loss' in the results, but the server's get_logged_items() did not include it.

- To avoid vocabulary size mismatch between model and tokenizer during CORE evaluation.

- Updated log message from "global accuracy" to "Average Centered CORE benchmark metric" - Used ruff to format code

…ORE metadata so ty check is clean again.

- Added instructions for initializing submodules and resolving maturin build failure.

- Included configurations for both pre-trained and custom modes.

…tasources.

netlify · 2025-12-01T04:53:25Z

✅ Deploy Preview for platodocs canceled.

Name	Link
🔨 Latest commit	`20ab574`
🔍 Latest deploy log	https://app.netlify.com/projects/platodocs/deploys/692f507818245f0008203aed

…HF examples.

baochunli and others added 30 commits October 28, 2025 08:51

Initial support for the Nanochat model and its evaluation benchmark (…

3355698

…core_eval.py).

Added support for vendoring the external Nanochat repo as a git submo…

822a1cb

…dule under 'external/nanochat'.

ruff check --fix & ruff format.

e039b2c

Added benchmark configuration ([evaluation]) support in config.py.

ac5beba

Added test to verify that [evaluation] configuration is properly loaded.

4501781

Fixed tensor contiguity issue in datasource.

a8efbea

- Resolved a RuntimeError caused by non-contiguous tensors during view operations (in nanochat - gpt.py): "view size is not compatible with input tensor's size and stride...". Replaced .view() with .reshape()

Fixed KeyError: 'train_loss'.

f0bc22d

- Resolved an issue where the configuration requested 'train_loss' in the results, but the server's get_logged_items() did not include it.

Fixed train_loss aggregation in FedAvg server to handle None values.

4810080

Added evaluation configs for nanochat CORE metric.

eb736eb

Added automatic download of nanochat CORE evaluation bundle.

d9fe94a

Using tokenizer's vocab_size to match between model and tokenizer.

6f34950

- To avoid vocabulary size mismatch between model and tokenizer during CORE evaluation.

Added outputs for Nanochat CORE evaluation in FedAvg server.

35f25eb

Added specific logging output for CORE benchmark metrics.

e4ae761

- Updated log message from "global accuracy" to "Average Centered CORE benchmark metric" - Used ruff to format code

Typed the Nanochat datasource/optimizer plumbing and enforced valid C…

432fe50

…ORE metadata so ty check is clean again.

All nanochat tests now pass.

da04815

Updated nanochat README with setup and troubleshooting notes.

279d05e

- Added instructions for initializing submodules and resolving maturin build failure.

Added configuration file for NanoChat Parquet mode.

af0bafa

Formatted code with Ruff and applied autofixes.

2b7cf3d

Added two configuration files for PatchTSMixer model.

736b29e

- Included configurations for both pre-trained and custom modes.

Added MSE metric output for time series models.

1fe0e22

Added GitHub dataset handling (ETT datasets) for PatchTSMixer model.

205043d

Added ETT datasource to the registry.

12721f1

Added TimeSeriesDatasetWrapper support for time-series datasets in da…

73d19de

…tasources.

Added PatchTSMixer model support to HuggingFace model factory.

3bb745c

Added timeseries_utils module with is_timeseries_model function.

03abe2f

Added time-series support to the HuggingFace trainer.

c30dafc

Added documentation for time series model PatchTSMixer.

dc90468

Added links to time series model in docs.

ed13def

Revised dataset split to improve training performance.

7bc7f43

Added a larger PatchTSMixer config file with extended hyperparameters.

173095a

Jasmine-Yuting-Zhang added 2 commits December 1, 2025 04:46

Revised MSE evaluation logs for time series models.

5562465

Used uv ruff format .

b9778ec

Jasmine-Yuting-Zhang requested a review from baochunli December 1, 2025 04:53

Refactored ETT data splitting and normalization for consistency with …

20ab574

…HF examples.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Added PatchTSMixer model #404

Added PatchTSMixer model #404

Uh oh!

Jasmine-Yuting-Zhang commented Dec 1, 2025

Uh oh!

netlify bot commented Dec 1, 2025 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Added PatchTSMixer model #404

Are you sure you want to change the base?

Added PatchTSMixer model #404

Uh oh!

Conversation

Jasmine-Yuting-Zhang commented Dec 1, 2025

Description

How has this been tested?

Types of changes

Checklist:

Uh oh!

netlify bot commented Dec 1, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

✅ Deploy Preview for platodocs canceled.

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

netlify bot commented Dec 1, 2025 •

edited

Loading